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hypermedia 
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In IR (information retrieval) systems based on the vector space model, the TF-IDF scheme 
is widely used to characterize documents. However, in the case of documents with 
hyperlink structures such as Web pages, it is necessary to develop a technique for 
representing the contents of Web pages more accurately by exploiting the contents of their 
hyperlinked neighboring pages. In this paper, we first propose several approaches to 
refining the TF-IDF scheme for a target Web page by using the contents ... 

Keywords: TF-IDF scheme, WWW, hyperlink, information retrieval 
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Ashwin R. Bharambe, Mukesh Agrawal, Srinivasan Seshan 

August 2004 ACM SIGCOMM Computer Communication Review , Proceedings of the 
2004 conference on Applications, technologies, architectures, and 
protocols for computer communications, volume 34 issue 4 

Additional Information: full citation , abstract , references , citings , index 
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This paper presents the design of Mercury, a scalable protocol for supporting multi-attribute 
range-based searches. Mercury differs from previous range-based query systems in that it 
supports multiple attributes as well as performs explicit load balancing. To guarantee 
efficient routing and load balancing, Mercury uses novel light-weight sampling mechanisms 
for uniformly sampling random nodes in a highly dynamic overlay network. Our evaluation 
shows that Mercury is able to achieve ... 

Keywords: distributed hash tables, load balancing, peer-to-peer systems, random 
sampling, range queries 
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Ahmed Helmy 
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Full text available: ^ pdf(844.09 KB) Additional Information: full citation , abstract , references 

Queries and small transfers are likely to constitute a significant portion of the flows in 
emerging classes of sensor networks. Route discovery for such queries incurs much more 
communication overhead than the actual data transfer. Especially for large-scale sensor 
networks, it is quite costly to establish shortest path routes for such types of requests. 
Flooding-based approaches for routing are designed to search for high quality routes. Such 
approaches may be suitable for prolonged transfers, ... 

Nearest neighbor queries in road networks 

Christian S. Jensen, Jan Kolarvr, Torben Bach Pedersen, IgorTimko 

November 2003 Proceedings of the 11th ACM international symposium on Advances in 
geographic information systems 

Additional Information: full citation , abstract , references , citings , index 
terms 

With wireless communications and geo-positioning being widely available, it becomes 
possible to offer new e-services that provide mobile users with information about other 
mobile objects. This paper concerns active, ordered k-nearest neighbor queries for query 
and data objects that are moving in road networks. Such queries may be of use in many 
services.Specifically, we present an easily implementable data model that serves well as a 
foundation for such queries. We also pres ... 

Keywords: location-based services, nearest neighbors, query processing, road networks 
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June 1984 ACM Computing Surveys (CSUR), volume 16 issue 2 
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6 Special issue on wireless extensions to the internet: Scenario-based comparison of 
source-tracing and dynamic source routing protocols for ad hoc networks 
Jyoti Raju, J.J. Garcia-Luna-Aceves 

October 2001 ACM SIGCOMM Computer Communication Review, volume 31 issue 5 
Full text available: ^ pdfd.OO MB) Additional Information: full citation , abstract , references 

We present source tracing as a new viable approach to routing in ad hoc networks in which 
routers communicate the second-to-last hop and distance in preferred paths to 
destinations. We introduce a table-driven protocol (BEST) in which routers maintain routing 
information for all destinations, and an on-demand routing protocol (DST) in which routers 
maintain routing information for only those destinations to whom they need to forward 
data. Simulation experiments are used to compare these proto ... 

Keywords: On-demand routing, ad hoc networks, wireless routing 
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November 2003 Proceedings of the twelfth international conference on Information and 
knowledge management 

Additional Information: full citation , abstract , references , citings , index 



Full text available: r ._ 
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It is known that all multi-dimensional index structures fail to accelerate content-based 
similarity searches when the feature vectors describing images are high-dimensional. It is 
possible to circumvent this problem by relying on approximate search-schemes trading-off 
result quality for reduced query execution time. Most approximate schemes, however, 
provide none or only complex control on the precision of the searches, especially when 
retrieving the k nearest neighbors (NNs) of query poi ... 

Keywords: approximate nearest-neighbor searches, multimedia databases, similarity 
searches 



8 Sensor databases: The design of an acquisitional query processor for sensor networks 
Samuel Madden, Michael J. Franklin, Joseph M. Hellerstein, Wei Hong 
June 2003 Proceedings of the 2003 ACM SIGMOD international conference on 
Management of data 
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We discuss the design of an acquisitional query processor for data collection in sensor 
networks. Acquisitional issues are those that pertain to where, when, and how often data is 
physically acquired (sampled) and delivered to query processing operators/By focusing on 
the locations and costs of acquiring data, we are able to significantly reduce power 
consumption over traditional passive systems that assume the a priori existence of data. 
We discuss simple extensions to SQL for co ... 

On semantic caching and query scheduling for mobile nearest-neighbor search 
Baihua Zheng, Wang-Chien Lee, Dik Lun Lee 
November 2004 Wireless Networks, Volume 10 issue 6 

Full text available: ^ pdf(293.57 KB) Additional Information: full citation , abstract , references , index terms 

Location-based services have received increasing attention in recent years. In this paper, 
we address the performance issues of mobile nearest-neighbor search, in which the mobile 
user issues a query to retrieve stationary service objects nearest to him/her. An index 
based on Voronoi Diagram is used in the server to support such a search, while a semantic 
cache is proposed to enhance the access efficiency of the service. Cache replacement 
policies tailored for the proposed semantic cache are ... 

Keywords: Voronoi diagram, indexing technique, location-based services, nearest- 
neighbor search, query scheduling, roaming, semantic caching 
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June 2004 Proceedings of the 7th International Workshop on the Web and Databases: 
colocated with ACM SIGMOD/ PODS 2004 

Full text available: ^ pdf(208.01 KB) Additional Information: full citation , abstract , references 

Peer-to-peer systems enable access to data spread over an extremely large number of 
machines. Most P2P systems support only simple lookup queries. However, many new 
applications, such as P2P photo sharing and massively multi-player games, would benefit 
greatly from support for multidimensional range queries. We show how such queries may 
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be supported in a P2P system by adapting traditional spatial-database technologies with 
novel P2P routing networks and load-balancing algorithms. We show how to ... 



11 An efficient algorithm for link-distance problems 
Y. Ke 

June 1989 Proceedings of the fifth annual symposium on Computational geometry 

Full text available- f£| pdf(1.08 MB) Additional Information: full citation, abstract , references , citings, index 

The link distance between two points inside a simple polygon P is defined to be the 
minimum number of edges required to form a polygonal path inside P that connects the 
points. A link furthest neighbor of a point p &Egr; P is a point of P whose link distance is the 
maximum from p. The link center of P is the collection of p ... 

12 Energy efficiency: Localized algorithms for energy efficient topology in wireless ad hoc 
networks 

Wen-Zhan Song, Yu Wang, Xiang-Yang Li 

May 2004 Proceedings of the 5th ACM international symposium on Mobile ad hoc 
networking and computing 

Full text available- t B|pdff207.01 KB) Additional Information: full citation , abstract, references , citings, index 

terms 

We propose several novel localized algorithms to construct energy efficient routing 
structures for homogeneous wireless ad hoc networks, where all nodes have same 
maximum transmission ranges. Our first structure has the following attractive properties: 
(1) It is energy efficient: given any two nodes u and v, there is a path connecting them in 
the structure with total energy cost at most p = l/l-(2sin n/k) p times of the energy cost of 
any path connecting th ... 

Keywords: bounded degree, planar, spanner, topology control, wireless ad hoc networks 



13 Processing in-route nearest neighbor queries: a comparison of alternative approaches ||£ 
Shashi Shekhar, Jin Soung Yoo 

November 2003 Proceedings of the 11th ACM international symposium on Advances in 
geographic information systems 

Full text available: "jf^ pdf(440.85 KB) Additional Information: full citation , abstract , references , index terms 

Nearest neighbor query is one of the most important operations in spatial databases and 
their application domains, e.g., location-based services, advanced traveler information 
systems, etc. This paper addresses the problem of finding the in-route nearest neighbor 
(IRNN) for a query object tuple which consists of a given route with a destination and a 
current location on it. The IRNN is a facility instance via which the detour from the original 
route on the way to the destination is smallest. Thi ... 

Keywords: advanced traveler information systems, location-based services, nearest 
neighborhood query, road network, route 



14 Special section: Reasoning about structure, behavior and function 
B. Chandrasekaran, Rob Milne 
July 1985 ACM SIGART Bulletin, issue 93 

Full text available: ^ |pdf(5.13 MB) Additional Information: full citation , abstract , references 

The last several years' of work in the area of knowledge-based systems has resulted in a 
deeper understanding of the potentials of the current generation of ideas, but more 
importantly, also about their limitations and the need for research both in a broader 
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framework as well as in new directions. The following ideas seem to us to be worthy of note 
in this connection. 

15 Historical queries along multiple lines of time evolution 
Gad M. Landau, Jeanette P. Schmidt, Vassilis J. Tsotras 

October 1995 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 4 Issue 4 

Full text available: " ^pdfM.41 MB) Additional Information: full citation , abstract , references , citings 

Traditional approaches to addressing historical queries assume a single line of time 
evolution; that is, a system (database, relation) evolves over time through a sequence of 
transactions. Each transaction always applies to the unique, current state of the system, 
resulting in a new current state. There are, however, complex applications where the 
system's state evolves into multiple lines of evolution. In general, this creates a tree 
(hierarchy) of evolution lines, where each tree ... 

Keywords: CAD databases, access methods, data-structures, rollback databases 



16 Technical papers: mobile and distributed computing: Network abstractions for context- 
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Gruia-Catalin Roman, Christine Julien, Qingfend Huang 

May 2002 Proceedings of the 24th International Conference on Software Engineering 

Additional Information: full citation , abstract , references , citings , index 



Full text available: ■ ^ , _ = ^ 

terms 

Context-aware computing is characterized by the ability of a software system to 
continuously adapt its behavior to a changing environment over which it has little or no 
control. Previous work along these lines presumed a rather narrow definition of context, one 
that was centered on resources immediately available to the component in question, e.g., 
communication bandwidth, physical location, etc. This paper explores context-aware 
computing in the setting of ad hoc networks consisting of numerous ... 

17 On-demand multicast routing protocol in multihop wireless mobile networks | 
Sung Ju Lee, William Su, Mario Gerla 

December 2002 Mobile Networks and Applications, volume 7 issue 6 

Additional Information: full citation , abstract , references , citings , index 



Full text available: . 

terms 

An ad hoc network is a dynamically reconfigurable wireless network with no fixed 
infrastructure or central administration. Each host is mobile and must act as a router. 
Routing and multicasting protocols in ad hoc networks are faced with the challenge of 
delivering data to destinations through multihop routes in the presence of node movements 
and topology changes. This paper presents the On-Demand Multicast Routing Protocol 
(ODMRP) for wireless mobile and hoc networks. ODMRP is a mesh-based, rat ... 

Keywords: ad hoc networks, mobile computing, multicast, routing 
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J. J. Garcia-Luna-Aceves 

August 1989 ACM SIGCOMM Computer Communication Review , Symposium 
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We present a unified approach for the dynamic computation of shortest paths in a computer 
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network using either distance vectors or link states. We describe a distributed algorithm 
that provides loop-free paths at every instant and extends or improves algorithms 
introduced previously by Chandy and Misra, Jaffe and Moss, Merlin and Segall, and the 
author. Our approach treats the problem of distributed shortest-path routing as one of 
diffusing computations, which was first proposed by Dijkstra ... 

19 Peer-to-peer computing: Foreseer: a novel locality-aware peer-to-peer system 

architecture for keyword searches 
Hailong Cai, Jun Wang 

October 2004 Proceedings of the 5th ACM/IFIP/USENIX international conference on 



Full text available: ^ pdf(315.85 KB) Additional Information: full citation , abstract , references 

Peer-to-peer (P2P) systems are becoming increasingly popular and complex, serving 
millions of users today. However, the design of current unstructured P2P systems does not 
take full advantage of rich locality properties present in P2P system workloads, thus 
possibly resulting in inefficient searches or poor system scalability. In this paper, we 
propose a novel locality-aware P2P system architecture called Foreseer, which explicitly 
exploits <I>geographical</I> locality and <I>t ... 

Keywords: Bloom filter, Foreseer, geographical locality, temporal locality 



20 An optimal algorithm for approximate nearest neighbor searching fixed dimensions 
Sunil Arya, David M. Mount, Nathan S. Netanyahu, Ruth Silverman, Angela Y. Wu 
November 1998 Journal of the ACM (JACM), volume 45 issue 6 



Consider a set of S of n data points in real d-dimensional space, Rd, where distances are 
measured using any Minkowski metric. In nearest neighbor searching, we preprocess S into 
a data structure, so that given any query point q e Rd, is the closest point of S to q can be 
reported quickly. Given any po ... 

Keywords: approximation algorithms, box-decomposition trees, closet-point queries, 
nearest neighbor searching, post-office problem, priority search 
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1 Peer-to-peer computing: Foreseer: a novel, locality-aware peer-to-peer system 

architecture for keyword searches 
Hailong Cai, Jun Wang 

October 2004 Proceedings of the 5th ACM/IFIP/USENIX international conference on 
Middleware 

Full text available: ^ pdf(315.85 KB) Additional Information: full citation , abstract , references 

Peer-to-peer (P2P) systems are becoming increasingly popular and complex, serving 
millions of users today. However, the design of current unstructured P2P systems does not 
take full advantage of rich locality properties present in P2P system workloads, thus 
possibly resulting in inefficient searches or poor system scalability. In this paper, we 
propose a novel locality-aware P2P system architecture called Foreseer, which explicitly 
exploits <I>geographical</I> locality and <I>t ... 



Keywords: Bloom filter, Foreseer, geographical locality, temporal locality 



2 Paper session 2: peer-to-peer search systems: One torus to rule them all: multi- 
dimensional queries in P2P systems 

Prasanna Ganesan, Beverly Yang, Hector Garcia-Molina 

June 2004 Proceedings of the 7th International Workshop on the Web and Databases: 
colocated with ACM SIGMOD/PODS 2004 

Full text available: pdf(208.01 KB) Additional Information: full citation , abstract , references 

Peer-to-peer systems enable access to data spread over an extremely large number of 
machines. Most P2P systems support only simple lookup queries. However, many new 
applications, such as P2P photo sharing and massively multi-player games, would benefit 
greatly from support for multidimensional range queries. We show how such queries may 
be supported in a P2P system by adapting traditional spatial-database technologies with 
novel P2P routing networks and load-balancing algorithms. We show how to ... 

3 Special issue on wireless pan & sensor networks: CAPTURE: location-free contact- 
assisted power-efficient query resolution for sensor networks 

Ahmed Helmy 

January 2004 ACM SIGMOBILE Mobile Computing and Communications Review, volume 8 
Issue 1 
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Queries and small transfers are likely to constitute a significant portion of the flows in 
emerging classes of sensor networks. Route discovery for such queries incurs much more 
communication overhead than the actual data transfer. Especially for large-scale sensor 
networks, it is quite costly to establish shortest path routes for such types of requests. 
Flooding-based approaches for routing are designed to search for high quality routes. Such 
approaches may be suitable for prolonged transfers, ... 

4 Session 2: Distributed object location in a dynamic network 
Kirsten Hildrum, John D. Kubiatowicz, Satish Rao, Ben Y. Zhao 

August 2002 Proceedings of the fourteenth annual ACM symposium on Parallel 
algorithms and architectures 

Full text available: f»W(234.76 KB) Additional Information: full citation , attract, references , citings, index 
LLii "^ terms 

Modern networking applications replicate data and services widely, leading to a need for 
location-independent routing - the ability to route queries directly to objects using names 
independent of the objects' physical locations. Two important properties of a routing 
infrastructure are routing locality and rapid adaptation to arriving and departing nodes. We 
show how these two properties can be efficiently achieved for certain network topologies. 
To do this, we present a new ... 

Keywords: DHT, DOLR, distributed hash table, distributed object location, locality, nearest 
neighbor, networking, overlay, peer-to-peer, tapestry 
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Ashwin R. Bharambe, Mukesh Agrawal, Srinivasan Seshan 

August 2004 ACM SIGCOMM Computer Communication Review , Proceedings of the 
2004 conference on Applications, technologies, architectures, and 
protocols for computer communications, volume 34 issue 4 

Additional Information: full citation , abstract , references , citings , index 
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This paper presents the design of Mercury, a scalable protocol for supporting multi-attribute 
range-based searches. Mercury differs from previous range-based query systems in that it 
supports multiple attributes as well as performs explicit load balancing. To guarantee 
efficient routing and load balancing, Mercury uses novel light-weight sampling mechanisms 
for uniformly sampling random nodes in a highly dynamic overlay network. Our evaluation 
shows that Mercury is able to achieve ... 

Keywords: distributed hash tables, load balancing, peer-to-peer systems, random 
sampling, range queries 
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Jochen Behrens, J. J. Garcia-Luna-Aceves 

October 1994 ACM SIGCOMM Computer Communication Review , Proceedings of the 
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Volume 24 Issue 4 

r- ll4 A i b£o^ mdv Additional Information: full citation , abstract , references , citings , index 

Full text available: t?.1 pdf(1.42 MB) : 
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A new family of routing algorithms for the distributed maintenance of routing information in 
large networks and internets is introduced. This family is called link vector algorithms 
(LVA), and is based on the selective diffusion of link-state information based on the 
distributed computation of preferred paths, rather than on the flooding of complete link- 
state information based on the distributed computation of preferred paths, rather than on 
the flooding of complete link-state information ... 
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Links for a better web: Refinement of TF-IDF schemes for web pages using their 
hyperlinked neighboring pages 

Kazunari Sugiyama, Kenji Hatano, Masatoshi Yoshikawa, Shunsuke Uemura 
August 2003 Proceedings of the fourteenth ACM conference on Hypertext and 
hypermedia 
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L - J "^ terms 

In IR (information retrieval) systems based on the vector space model, the TF-IDF scheme 
Is widely used to characterize documents. However, in the case of documents with 
hyperlink structures such as Web pages, it is necessary to develop a technique for 
representing the contents of Web pages more accurately by exploiting the contents of their 
hyperlinked neighboring pages. In this paper, we first propose several approaches to 
refining the TF-IDF scheme for a target Web page by using the contents ... 

Keywords: TF-IDF scheme, WWW, hyperlink, information retrieval 



8 A unified approach to loop-free routing using distance vectors or link states 
J. J. Garcia-Luna-Aceves 

August 1989 ACM SIGCOMM Computer Communication Review , Symposium 
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We present a unified approach for the dynamic computation of shortest paths in a computer 
network using either distance vectors or link states. We describe a distributed algorithm 
that provides loop-free paths at every instant and extends or improves algorithms 
introduced previously by Chandy and Misra, Jaffe and Moss, Merlin and Segall, and the 
author. Our approach treats the problem of distributed shortest-path routing as one of 
diffusing computations, which was first proposed by Dijkstra ... 

9 Peer to peer networks: Querv-flood DoS attacks in gnutella 
Neil Daswani, Hector Garcia-Molina 

November 2002 Proceedings of the 9th ACM conference on Computer and 

communications security 
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Full text available: T?;i pdf(465.15 KB) 
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We describe a simple but effective traffic model that can be used to understand the effects 
of denial-of-service (DoS) attacks based on query floods in Gnutella networks. We run 
simulations based on the model to analyze how different choices of network topology and 
application level load balancing policies can minimize the effect of these types of DoS 
attacks. In addition, we also study how damage caused by query floods is distributed 
throughout the network, and how application-level policies ca ... 

Keywords: denial-of-service, peer-to-peer, security 
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Peer-to-Peer networking has become a major research topic over the last few years. 
Sharing of structured data in such decentralized environments is a challenging problem, 
especially in the absence of a global schema. The standard practice of answering a query 
that is consecutively rewritten along the propagation path often results in significant loss of 
information. In this paper, we present an adaptive and bandwidth-efficient solution to the 
problem in the context of an unstructured, purely ... 

Keywords: peer-to-peer, query reformulation, structured data 
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Y. Ke 

June 1989 Proceedings of the fifth annual symposium on Computational geometry 

Full text available- fK lpdfM.08 MB) Additional Information: full citation , abstract, references , citings, index 
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The link distance between two points inside a simple polygon P is defined to be the 
minimum number of edges required to form a polygonal path inside P that connects the 
points. A link furthest neighbor of a point p &Egr; P is a point of P whose link distance is the 
maximum from p. The link center of P is the collection of p ... 

14 AGM: a dataflow database machine 
Lubomir Bic, Robert L. Hartmann 

March 1989 ACM Transactions on Database Systems (TODS), volume 14 issue l 

Full text available- fB| pdffZ69 MB) Additional Information: full citation , abstract, references , citings, index 
^ terms , review 

In recent years, a number of database machines consisting of large numbers of parallel 
processing elements have been proposed. Unfortunately, there are two main limitations in 
database processing that prevent a high degree of parallelism; these are the available I/O 
bandwidth of the underlying storage devices and the concurrency control mechanisms 
necessary to guarantee data integrity. The main problem with conventional approaches is 
the lack of a computational model capable of utilizing th ... 

15 Searching in metric spaces by spatial approximation 
Gonzalo Navarro 

August 2002 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 11 Issue 1 

Full text available: W\ pdf(281.75 KB) Additional Information: full citation , abstract , citings , index terms 
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We propose a new data structure to search in metric spaces. A metric space is formed by a 
collection of objects and a distance function defined among them which satisfies the triangle 
inequality. The goal is, given a set of objects and a query, retrieve those objects close 
enough to the query. The complexity measure is the number of distances computed to 
achieve this goal. Our data structure, called sa-tree ("spatial approximation tree"), is based 
on approaching ... 

Keywords: Multimedia databases, Similarity or proximity search, Spatial and 
multidimensional search, Spatial approximation tree 



16 Distributed information retrieval: SETS: search enhanced by topic segmentation 
Mayank Bawa, Gurmeet Singh Manku, Prabhakar Raghavan 

July 2003 Proceedings of the 26th annual international ACM SIGIR conference on 
Research and development in informaion retrieval 

Full text available* ^ lpdf(307.88 KB) Additional Information: full citation , abstract, references , citings, index 

terms 

We present SETS, an architecture for efficient search in peer-to-peer networks, building 
upon ideas drawn from machine learning and social network theory. The key idea is to 
arrange participating sites in a topic-segmented overlay topology in which most connections 
are short-distance, connecting pairs of sites with similar content. Topically focused sets of 
sites are then joined together into a single network by long-distance links. Queries are 
matched and ro ... 

Keywords: distributed information retrieval, peer-to-peer (P2P), small world networks, 
topic segments, topic-driven query routing 



17 Posters: Distributed ranking over peer-to-peer networks 
Dexter Chi Wai Siu, Tak Pang Lau 

May 2004 Proceedings of the 13th international World Wide Web conference on 
Alternate track papers & posters 

Full text available: ^ pdf(181.71 KB) Additional Information: full citation , abstract , references , index terms 

Query flooding is a problem existing in Peer-to-Peer networks like Gnutella. Firework Query 
Model solves this problem by Peer Clustering and routes the query message more 
intelligently. However, it still contains drawbacks like query flooding inside clusters. The 
condition can be improved if the query message can send directly to the query destination, 
as the message does not need to send hop by hop. This can be achieved by ranking. By 
ranking, the network can know the destination and the inform ... 

Keywords: distributed peer ranking, peer-to-peer networks 



18 Community search assistant 
Natalie S. Glance 

January 2001 Proceedings of the 6th international conference on Intelligent user 
interfaces 

Full text available: Qpdf(181.91 KB) Additional Information: full citation , abstract , references , index terms 

This paper describes a new software agent, the community search assistant, which 
recommends related searches to users of search engines. The community search assistant 
enables communities of users to search in a collaborative fashion. All queries submitted by 
the community are stored in the form of a graph. Links are made between queries that are 
found to be related. Users can peruse the network of related queries in an ordered way: 
following a path from a first cousin, to a second cousin t ... 
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19 Local versus global link information in the Web 

Pcivel Calado, Berthier Ribeiro-Neto, Nivio Ziviani, Edleno Moura, Ilmerio Silva 
January 2003 ACM Transactions on Information Systems (TOIS), volume 21 issue 1 

Full text available' W\ odf(41 3 06 KB) Add i tional Information: full citation , abstract , references , citings , index 
— : terms 

Information derived from the cross-references among the documents in a hyperlinked 
environment, usually referred to as link information, is considered important since it can be 
used to effectively improve document retrieval. Depending on the retrieval strategy, link 
information can be local or global. Local link information is derived from the set of 
documents returned as answers to the current user query. Global link information is derived 
from all the documents in the collection. In th ... 

Keywords: Belief networks, World Wide Web, link analysis, local and global information 



20 A survey of Web metrics 

Devanshu Dhyani, Wee Keong Ng, Sourav S. Bhowmick 

December 2002 ACM Computing Surveys (CSUR), volume 34 issue 4 

Full text available* Wi pdf(289 28 KB) Additional Information: full citation , abstract , references , citings , index 
• liyj-H—i terms 

The unabated growth and increasing significance of the World Wide Web has resulted in a 
flurry of research activity to improve its capacity for serving information more effectively. 
But at the heart of these efforts lie implicit assumptions about "quality" and "usefulness" of 
Web resources and services. This observation points towards measurements and models 
that quantify various attributes of web sites. The science of measuring all aspects of 
information, especially its storage and retrieval or ... 

Keywords: Information theoretic, PageRank, Web graph, Web metrics, Web page 
similarity, quality metrics 
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